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Abstract. This paper considers the computational hardness of comput¬ 
ing expected outcomes and deciding (universal) (positive) almost-sure 
termination of probabilistic programs. It is shown that computing lower 
and upper bounds of expected outcomes is E°- and ^-complete, respec¬ 
tively. Deciding (universal) almost-sure termination as well as deciding 
whether the expected outcome of a program equals a given rational value 
is shown to be /^-complete. Finally, it is shown that deciding (universal) 
positive almost-sure termination is .E 2 -complete (TTf-complete). 
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1 Introduction 

Probabilistic programs [1] are imperative programs with the ability to toss a 
(possibly) biased coin and proceed their execution depending on the outcome of 
the coin toss. They are used in randomized algorithms, in security to describe 
cryptographic constructions (such as randomized encryption) and security ex¬ 
periments [2], and in machine learning to describe distribution functions that are 
analyzed using Bayesian inference [3]. Probabilistic programs are typically just 
a small number of lines, but hard to understand and analyze, let alone algorith¬ 
mically. This paper considers the computational hardness of two main analysis 
problems (and variations thereof) for probabilistic programs: 

1. Computing expected outcomes: Is the expected outcome of a program vari¬ 
able smaller than, equal to, or larger than a given rational number? 

2. Deciding [universal] (positive) almost-sure termination: Does a program ter¬ 
minate [on all inputs] with probability one (within an expected finite number 
of computation steps)? 

The first analysis problem is related to determining weakest pre-expectations 
of probabilistic programs [4,5]. Almost-sure termination is an active field of 
research [6]. A lot of work has been done towards automated reasoning for 
almost-sure termination. For instance, [7] gives an overview of some particu¬ 
larly interesting examples of probabilistic logical programs and the according 
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intuition for proving almost-sure termination. Arons et al. [8] reduce almost- 
sure termination to termination of non-deterministic programs by means of a 
planner. This idea has been further exploited and refined into a pattern-based 
approach with prototypical tool support [9]. 

Despite the existence of several (sometimes automated) approaches to tackle 
almost-sure termination, most authors claim that it must intuitively be harder 
than the termination problem for ordinary programs. To mention a few, Mor¬ 
gan [10] remarks that while partial correctness for small-scale examples is not 
harder to prove than for ordinary programs, the case for total correctness of a 
probabilistic loop must be harder to analyze. Esparza et al. [9] claim that almost- 
sure termination must be harder to decide than ordinary termination since for 
the latter a topological argument suffices while for the former arithmetical rea¬ 
soning is needed. The computational hardness of almost-sure termination has 
however received scant attention. As a notable exception, [11] establishes that 
deciding almost-sure termination of certain concurrent probabilistic programs 
is in II 2 *. 

In this paper, we give precise classifications of the level of arithmetical reason¬ 
ing that is needed to decide the aforementioned analysis problems by establishing 
the following results: We first show that computing lower bounds on the expected 
outcome of a program variable v after executing a probabilistic program P on 
a given input y is //[-complete and therefore arbitrarily close approximations 
from below are computable. Computing upper bounds, on the other hand, is 
shown to be //[(-complete, thus arbitrarily close approximations from above are 
not computable in general. Deciding whether an expected outcome equals some 
rational is shown to be //((-complete. 

For the second analysis problem—almost-sure termination—we obtain that 
deciding almost-sure termination of probabilistic program P on a given input y 
is //((-complete. While for ordinary programs we have a complexity leap when 
moving from the non-universal to the universal halting problem, we establish 
that this is not the case for probabilistic programs: Deciding universal a.s ter¬ 
mination turns out to be //^-complete too. The case for positive almost-sure 
termination is different however: While deciding (non-universal) positive almost- 
sure termination is //[(-complete, we show that universal positive almost-sure 
termination is //([-complete. 

2 Preliminaries 

As indicated, our hardness results will be stated in terms of levels in the arith¬ 
metical hierarchy—a concept we briefly recall: 

Definition 1 (Arithmetical Hierarchy [12,13]). For every n £ IN, the class 

is defined as = {A \ A = {x | 3y± \/y 2 3y 3 • • • 3/Vy n : (x, yi, y 2 , 2 / 3 , • • •, 
y n ) £ P}, Pisa decidable relation}, the class 17° is defined as 17° = {A | A = 
{x | Vj/i 3y 2 Vj /3 • • • 3/\/y n : ( x , 2 / 1 , y 2 , 2 / 3 , ■ • ■, 2 In) £ P) , P is a decidable rela¬ 
tion} and the class is defined as Z\° = //°nl/°. Note that we require that the 
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values of the variables are drawn from a recursive domain. Multiple consecutive 
quantifiers of the same type can be contracted to one quantifier of that type, 
so the number n really refers to the number of necessary quantifier alternations 
rather than to the number of quantifiers used. A set A is called arithmetical, 
iff A € P°, for r £ {77, n, A} and n £ N. The arithmetical sets form a strict 
hierarchy, i.e. Z\° C P° C A n+ 1 and 77° ^ 77° holds for T £ {77, 77} and n > 1. 
Furthermore, note that 77° = 77° = Z\° = Zl° is exactly the class of the decidable 
sets and 77° is exactly the class of the recursively enumerable sets. 

Next, we recall the concept of many-one reducibility and completeness: 

Definition 2 (Many One Reducibility and Completeness [13,14,15]). 

Let A, B be arithmetical sets and let X be some appropriate universe such that 
A,B C X. A is called many-one-reducible to B, denoted A. < m B, iff there 
exists a computable function f: X —> X, such that \f x £ X: (x £ A -€=>■ /( x) £ 
B). If f is a function such that f many-one reduces A to B. we denote this by 
/: A < m B. Note that < m is transitive. 

A is called r°-complete, for r £ {77, 77, A}, iff both A £ P° and A is 
r®-hard, meaning C < m A, for any set C € P°. Note that if A is T®-complete 
and A < m B, then B is necessarily T^-hard. Furthermore, note that if A is in¬ 
complete, then A £ 77°\77°. Analogously if A is 77° -complete, then A £ 77°\77°. 

3 Probabilistic Programs 

In order to speak about probabilistic programs and the computations performed 
by such programs, we briefly introduce the syntax and semantics we use: 

Definition 3 (Syntax). Let Var be the set of program variables. The set Prog 
of probabilistic programs adheres to the following grammar: 

Prog —» v := e | Prog; Prog | {Prog} [p] {Prog} | WHILE ( b ) {Prog} , 

where v £ Var, e is an arithmetical expression over Var, p £ [0, 1] C Q, and 
b is a Boolean expression over arithmetic expressions over Var. We call the set 
of programs that do not contain any probabilistic choices the set of ordinary 
programs and denote this set by ordProg. 

The presented syntax is the one of the fully probabilistic 1 fragment of the prob¬ 
abilistic guarded command language (pGCL) originally due to Mclver and Mor¬ 
gan [4], We omitted skip-, abort-, and if-statements, as those are syntactic 
sugar. While assignment, concatenation, and the while-loop are standard pro¬ 
gramming constructs, {Pi} \p\ {P 2 } denotes a probabilistic choice between pro¬ 
grams Pl (with probability p) and P 2 (with probability 1 — p). An operational 
semantics for pGCL programs is given below: 

1 Fully probabilistic programs may contain probabilistic but no non-deterministic 
choices. 
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Definition 4 (Semantics). Let the set of variable valuations be denoted by V = 
{77 | 77: Var — > Q + }, let the set of program states be denoted by S = (ProgU{ 4 -}) x 
Yx I x {L, R}*, for I = [0, 1] (~|Q + , let [e]^ be the evaluation of the arithmetical 
expression e in the variable valuation 77, and analogously let |6]|^ be the evaluation 
of the Boolean expression b. Then the semantics of probabilistic programs 
is given by the smallest relation hCSxS which satisfies the following inference 
rules: 


(assign) 

(concatl) 

(concat 2 ) 


(v := e, 77 , a, 9) b (|, 77 ^ max^e],,, 0 }], a , 9) 

(Pi, 77 , a, 9) b (P[, 77 ', a', 9') 

{Pi; Pi, V, a, 9) b (P {; P 2 , 77 ', a', 9') 


(I; P 2 , 77 , a, 9) b (P 2 , 77 , a, 9) 


(piobl) ^ {p 2 } 5 7?j a> Q) \- (P 1; 77 ,a-p,9-L) 

(prob2) (miMmi,77,a,0) b (p 2 ,^ a .(i-p),0.i?) 
f h1 n _ P4 = Tru e _ 

lW11 j (WHILE ( 6 ) {P}, 77 , a, 6») b (P; WHILE ( 6 ) {P}, 77 , a, 9) 

, , If61„ = False 

(while 2 -- 

(WHILE (fe) {P}, 77 , a, 0) b (|, 77 , a, 9) 

We use a b fc r in the usual sense. 

The semantics is mostly straightforward except for two features: in addition to 
the program that is to be executed next and the current variable valuation, each 
state also stores a sequence 9 that encodes which probabilistic choices were made 
in the past (Left or Pight) as well as the probability a that those choices were 
made. The graph that is spanned by the b-relation can be seen as an unfolding 
of the Markov decision process semantics for pGCL provided by Gretz et al. [5] 
when restricting oneself to fully probabilistic programs. 


4 Expected Outcomes and Termination Probabilities 

In this section we formally define the notion of an expected outcome as well the 
notion of (universal) (positive) almost-sure termination. We start by investigat¬ 
ing how state successors can be computed. 

It is a well-known result due to Kleene that for any ordinary program P and 
a state a the fc-th successor of a with respect to b is unique and computable. If, 
however, P is a probabilistic program containing probabilistic choices, the fc-th 
successor of a state need not be unique, because at various points of the execution 
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the program must choose a left or a right branch with some probability. However, 
if we resolve those choices by providing a sequence of symbols w over the alphabet 
{L, R} that encodes for all probabilistic choices which occur whether the Left 
or the Light branch shall be chosen at a branching point, we can construct a 
computable function that computes a unique fc-th successor. Notice that for this 
purpose a sequence of finite length is sufficient. We obtain the following: 

Proposition 1 (The State Successor Function). Let S_l = S U {_L}. There 
exists a total computable function T: IN x S x {L, R}* —> S_l, such that for k > 1 


rp / X _ I T ifw=£, 

°^’ ^ (_L, otherwise, 

I T k -i (r, w'), if a = (P, 77, a, 9) b (P', 77', a', 9 ■ b) = r, 

Tfc(cr, w) = < with w = b ■ w' and b G (L, P, e}, 

[_L otherwise. 

So Tfc(tr, w) returns a successor state r, if a b fe r, whereupon exactly |u;| infer¬ 
ences must use the (probl)- or the (prob 2 )- rule and those probabilistic choices 
are resolved according to w. Otherwise Tfc(cr, w) returns _L. Note in particular 
that for both the inference of a terminal state (f, 77, a, 9) within less than k steps 
as well as the inference of a terminal state through less or more than |to| prob¬ 
abilistic choices, the calculation of Tfc(<r, w ) will result in _L. In addition to T, 
we will need two more computable operations for expressing expected outcomes, 
termination probabilities, and expected runtimes: 

Proposition 2. There exist two total computable functions a: S_l —>• Q + and 
p: Sj_ x Var —>• Q + , such that 


a(a) 


a, if a= (I, _, a, _) 
0 , otherwise, 


p(er, v) 


V(v) ■a , if a= 4, 77 , a, _) 
0 , otherwise, 


where _ represents an arbitrary value. 

The function a takes a state a and returns the probability of reaching a. The 
function p takes a state tr and a variable v and returns the probability of reaching 
a multiplied with the value of v in the state a. Both functions do that only if 
the provided state <7 is a terminal state. Otherwise they return 0. Based on the 
above notions, we now definie expected outcomes, termination probabilities and 
expected times until termination: 

Definition 5 (Expected Outcome, Termination Probability, and Ex¬ 
pected Time until Termination). Let P € Prog, 77 G V, v G Var, ap rl = 
(P, 77 , 1, e), and for a finite alphabet A let A~ k = (J*L 0 A z . Then 

1 . the expected outcome of v after executing P on 77 , denoted Ep^(w), is 

OO 

E p, v (v) = ^2 51 p( T fe(o-p,r,, w), v) , 

fc=0 toe{L,fl}< fc 
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2. the probability that P terminates on rj, denoted Prp 5?7 (4-), is 

OO 

Prp,„(4.) = E E a(T k (a PtV , w)) , 

k —0 uig{L,_R}< fe 

3. the expected time until termination of P on 77 , denoted Ep ;rj (4-), is 

OO / 

Ep,,(4.) = E I 1 ~ E a ( T k(<rp,T), w)) 

fc =0 \ we{L,R}< k 

The expected outcome E p^(v) as defined here coincides with the weakest pre¬ 
expectation wp.P.v ( t)) a la Mclver and Morgan [4] for fully probabilistic pro¬ 
grams. In the above definition for E p >rt {v), we sum over all possible numbers of 
inference steps k and sum over all possible sequences from length 0 up to length 
k for resolving all probabilistic choices. Using p we filter out the terminal states 
a and sum up the values of p(a, v). 

For the termination probability PrpQ,), we basically do the same but we 
merely sum up the probabilities of reaching final states by using a instead of p. 

For the expected time until termination Ep jl; (j,), we go along the lines of [ 6 ]: 
It is stated there that the expected time until termination of P on 77 can be 
expressed as P r (“-P runs for more than k steps on 77 ”) = ]UpLo (l — Pr(“P 

terminates within k steps on 77 ”)). We have expressed the latter in our set-up. 

In order to investigate the complexity of calculating E p, v (v), we define three 
sets: C£XV , which relates to the set of rational lower bounds of Ep fl (w), TZ£XV, 
which relates to the set of rational upper bounds, and £XV which relates to the 
value of Ep ;I) (u) itself: 

Definition 6 (C£XV, 1ZSXT, and £XV). The sets C£XT, H£XV, £XV C 
Prog x ¥ x Var x Q + are defined as (P , 77 , v, q) G C£XV iff q < Ep^(u), 
(P, 77, v, q) G 1Z£XV iff q > E PtV (v), and (P, 77, v, q) G £XV iff q = E P ^(u). 

Regarding the termination probability of a probabilistic program, the case of 
almost—sure termination is of special interest: We say that a program P termi¬ 
nates almost-surely on input 77 iff P terminates on 77 with probability 1. Further¬ 
more, we say that P terminates positively almost-surely on 77 iff the expected 
time until termination of P on 77 is finite. Lastly, we say that P terminates uni¬ 
versally (positively) almost-surely, if it does so on all possible inputs 77 . The 
problem of (universal) almost-sure termination can be seen as the probabilistic 
counterpart to the (universal) halting problem for ordinary programs. 

In the following, we formally define the according problem sets: 

Definition 7 (Almost—Sure Termination Problem Sets). The sets ASP, 
VASP , IAASP , and VASP are defined as follows: 

(P, 77 ) G AST <£=> Prp.jjd) = 1 (P, 77 ) G VAST Ep,^(4) < 00 

P G UAST <^> v??: (P, 77 ) g AST P G UVAST V 77 : (P, 77 ) g VAST 

Notice that both VASP C ASP and UVASP C UASP hold. 
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5 The Hardness of Computing Expected Outcomes 


In this section we investigate the computational hardness of deciding the sets 
CSXV, 1ZEXV, and EXT. The first fact we establish is the ^-completeness of 
CSXV. This result is established by reduction from the (non -universal) halting 
problem for ordinary programs: 

Theorem 1 (The Halting Problem [16]). The halting problem is a subset 
n C ordProg x V, which is characterized as (P, g) £ TL iff 3k 3g': Tfc(<rp jr; , 
e) = (f, r?', 1, e). Let TL denote the complement of the halting problem, i.e. 
H — (ordProg x V) \ H. H is E® -complete and H is 71° -complete. 

Theorem 2. CSXV is E®-complete. 

Proof. For CSXV £ E® consider the following: 



Figure 1 (left) gives an intuition on this formula. 

It remains to show that CSXV is E®- hard: We do this by proving Tl < m 
CSXV. Consider the following function /: Tl < m CSXV: f takes an ordinary 
program Q £ ordProg and a variable valuation g as its input and computes 
(P, g, v, !/ 2 ), where P is the probabilistic program v := 0 ;{v : = 1}[V 2 ]{ TQ; v : = 
1} and TQ is an ordinary program that simulates Q on g. 

Correctness of the reduction: There are two cases: (1) Q terminates on input 
g. Then the expected outcome of variable v after executing the program P on 
input g is 1, because in both branches, the variable will be set to 1. As Y 2 < 1 , 
we have that (P, g , v, Y 2 ) £ CSXV. 

(2) Q does not terminate on input g. Then the expected outcome of variable 
v after executing the program P on input g is Y 2 i because only in the left branch 
(which has probability Y 2 ), the variable will be set to 1. In the right branch, the 
program does not terminate and therefore the outcome of this branch is 0. As 
Y 2 Y 2 > we have that (P, g, v, Y 2 ) ^ CSXV. □ 


Theorem 2 implies that CSXV is recursively enumerable. This means that all 
lower bounds for expected outcomes can be effectively enumerated by some algo¬ 
rithm. Now, if upper bounds were recursively enumerable as well, then expected 
outcomes would be computable reals. However, the contrary will be shown by 
establishing that 1ZEXV is T^-complete, thus 1ZEXV Y ^1 an d hence 1ZEXV 
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is not recursively enumerable, ^-hardness will be established by a reduction 
from the complement of the universal halting problem for ordinary programs: 

Theorem 3 (The Universal Halting Problem [16]). The universal halt¬ 
ing problem is a subset UH C ordProg, which is characterized as P G UH 
iffVg: (-P, v) G H. Let UH denote the complement of UH. i.e., UH = 
ordProg \UH. UH is II 2 -complete andUH is -complete. 

Theorem 4. TZEXV is E® -complete. 

Proof. For TZEXV G E° consider the following: 

( P , 77 , v, q) € TZEXV 
q > E PtV (v) 

OO 

~ *>E E p(Tfc(o\p i?; , w), v) 

fc=0 w€{L,R}< k 
V 

3 5 V y: q — 5 > E E p(T k {crp t n, w), v ) 

fc=0 w£{L,R}< k 

=k TZEXV G 

Figure 1 (right) gives an intuition on this formula. 

It remains to show that TZEXV is 172 -hard: We do this by proving UH < m 
TZEXV. Consider the following function f: UH < m TZEXV: f takes an ordinary 
program Q G ordProg as its input and computes the triple (P, 77 , v, 1), where 
77 is an arbitrary but fixed input, and P G Prog is the following probabilistic 
program: 

i : = 0; (c := 0} [0.5] (c := 1}; 

while (c / 0){ i := i + 1; {c := 0} [0.5] (c := 1} }; 
k : = 0; (c := 0} [0.5] (c := 1}; 

while (c 7 ^ 0){ k := k + 1; (c := 0} [0.5] (c := 1} }; 
v : = 0 ; TQ 

TQ is a program that computes a(T k((Q, 5 q (*)> 1 ; e ); £ )) ' 2 fc+1 and stores the 
result in the variable v. and gQ : IN —> V is some computable bijection, such that 
Vz G Var: ( gQ(i))(z ) ^ 0 implies that 2 occurs in Q. 

Correctness of the reduction: a(T k((Q, <?q(*)> 1, e), e)) • 2 fc+1 returns 2 fe+1 
if and only if Q halts on input gg(i) after exactly k steps (otherwise it returns 
0). The two while-loops generate independent geometric distributions with pa¬ 
rameter Y ‘2 on i and fc, respectively, so the probability of generating exactly the 
numbers i and k is 1 / 2 k+1 ■ 1 / 2 k+1 = i/ 2 i+fc+2 . The expected outcome of v after 
executing the program P on any input 77 is hence 

OO OO / \ 

EE 2i+fc+2 ■ a (T*((G> 9Q{i), 1, e). e) J ^ k+1 ■ 

i=0 k =0 ' ' 
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Fig. 1 . Schematic depiction of the formulae defining C£XV and TZ£XP, respectively. 
In each diagram, the solid line represents the monotonically increasing graph of 
ELo E w e{L,R}<k p(T fc (o-p,^, w), v ) plotted over increasing y. 


Since for each input, the number of steps until termination of Q is either unique 
or does not exist, the formula for the expected outcome reduces to EEo IFFt = 1 
if and only if Q halts on every input after some finite number of steps. Thus if 
there exists an input on which Q does not eventually halt, then (P, 77 , v, 1) £ 
1Z£XV as then the expected value is strictly less than one. If, on the other hand, 
Q does halt on every input, then the expected outcome is exactly one and hence 

(p, n, v, l) £ iz£xv. □ 

Finally, we establish the following result regarding exact expected outcomes: 
Theorem 5. £XV is n® -complete. 

Proof. For £XV £ II® consider the following: By Theorem 4, there exists a decid¬ 
able relation 1Z , such that (P, 77 , v, x) £ 1Z£XV iff 3 nVr 2 : (jt, r 2 , P, rj, v, x ) £ 
7Z. Furthermore from Theorem 2 it follows that there exists a decidable relation 
£, such that (P, 77 , v, x) £ C£XV iff 3£: (£, P, 77 , v, x) £ C. Let ->TZ and ->C be 
the (decidable) negations of TZ and C, respectively, then: 


(P, V, v, q) £ £XV 

q = Ep }V (v) 

q < E p, v (v) A q > Ep.,(w) 


- , (<7 > Ep^(u)) A -i (<7 < E p, v {v)) 

-■(3ri Vr 2 : (ri, r 2 , P, 77, v, q) £lZ) A ->(3^: {t, P, 77 , v, q) £ C) 
(Vr 1 3r 2 : (n, r 2 , P, 77 , v, q) £ -dl) A (W: (£, P, 77, u, < 7 ) € ->C) 
Vri 3r 2 : (n, r 2 , P, 77 , u, q) £ ^TZ A (£, P, 77 , u, g) £ —oC 


fTP £ P 2 ° 


It remains to show that fTP is I/^-hard. We do this by proving UP. < m £XV. 
Consider again the reduction function / from the proof of Theorem 4: Given an 
ordinary program Q , / computes the triple (P, 77 , u, 1), where P is a probabilistic 
program which has an expected outcome of one for the variable v if and only if Q 
terminates on all inputs, which is nothing else than Q £ UP. Thus /: UP < m 
£XV. □ 
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6 The Hardness of Deciding Probabilistic Termination 

This section presents the main contributions of this paper: Hardness results on 
several variations of almost—sure termination problems. We first establish that 
deciding almost—sure termination of a program on a given input is TIj-complete: 

Theorem 6 . AST is Ilf -complete. 

Proof. For AST £ Ilf we show AST < m £XV. For that, consider the following 
function /: AST < m £XV which takes a probabilistic program Q and an input 
7 / as its input and computes the tuple (P, 77 , v, 1), where v £ Var does not occur 
in Q and P is the probabilistic program v : = 0; Q ; v : = 1 

Correctness of the reduction: On executing P, the variable v is set to one 
only in those runs in which the program Q terminates on 77 . So the expected 
value of v converges to one, if and only if the probability of Q terminating 
converges to one. So if (Q, if) £ AST , then and only then (P, 77 , v, 1) £ £XV. 
Thus f: AST < m £XV . By Theorem 5, £XV is 772 _com Pl e te, so it follows 
directly that AST £ Ilf. 

It remains to show that AST is Ilf-hard. For that we many-one reduce the 
772 -com pl e te universal halting problem to AST using the following function 
/': UU T m .ASP'. f takes an 01 dinary program Cj as its input and computes 
the pair (P', 77 ), where 77 is some arbitrary but fixed input and P' is the following 
probabilistic program: 

i := 0; {c := 0} [0.5] {c := l}; 

while (c ^ 0){ i := i + 1; {c := 0} [0.5] {c := 1} }; 

SQ 

where SQ is an ordinary program that on any input 77 simulates the program 
Q on input gQ(i), and gQ: IN —> V is some computable bijection, such that 
Vu £ Var: ( 9 Q(i))(v ) ^ 0 implies that v occurs in Q. 

Correctness of the reduction: The while-loop in P' establishes a geometric 
distribution with parameter !/2 on i and hence a geometric distribution on all 
possible inputs for Q. After the while-loop, the program Q is simulated on the 
input generated probabilistically in the while-loop. Obviously then the entire 
program P' terminates with probability one on any arbitrary input 77 , i.e. termi¬ 
nates almost—surely on 77 , if and only if the simulation of Q terminates on every 
input. Thus Q £ UP. if and only if (P', 77 ) £ AST. □ 

While for ordinary programs there is a complexity leap when moving from the 
halting problem for some given input to the universal halting problem, we estab¬ 
lish that there is no such leap in the probabilistic setting , i.e. HAST is as hard 
as AST : 

Theorem 7. HAST is Ilf -complete. 

Proof. For showing HAST £ 11®, consider that by Theorem 6 there must exist 
a decidable relation 1Z such that (P, if) £ AST iff V y\ 3 772 : (iji , 2 / 2 , P, 77 ) £ 1Z. 
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By that we have that P £ UAST iff V77 Vt/i 3 y 2 : (y 1,2/2, P, V) € 1Z , which is 
a III? formula. It remains to show that UAST is TP? hard. This can be done 
by proving AST < m UAST as follows: On input ( Q , 77) the reduction function 
/: AST < m UAST computes a probabilistic program P that first initializes all 
variables according to 77 and then executes Q. □ 

We now investigate the computational hardness of deciding positive almost—sure 
termination: It turns out that deciding VAST is— although still undecidable— 
computationally more benign than deciding AST , namely it is P^-complete. 
Thus VAST becomes semi-decidable when given access to an P-oracle whereas 
AST does not. We establish £2 hardness by a reduction from WH. This result is 
particularly counterintuitive as it means that for each ordininary program that 
does not halt on all inputs, we can compute a probabilistic program that does 
halt within an expected finite number of steps. 

Theorem 8. VAST is ^-complete. 

Proof. For VAST £ £2 consider the following: 


(P, 77) € VAST 
00 > Ep^(|) 

3 c: c > E P ,„( 4 .) 

OO / 

3 c:c>£ 1 - £ a(T k (a P , v , w)) 

k—0 \ we{L,R}< k 

3 cV£: c> Eh- E a(T k (a Pt r), w)) 

k—0 \ w£{L,R}< k 

VAST € £$ 


It remains to show that VAST is JCl?- hard. For that we use a reduction function 
/: UH < m VAST with f(Q) = (P, 77), where 77 is arbitrary but fixed and P is 
the program 

c := 1; i := 0; x := 0; term := 0; InitQ; 


while (c 7^ 0){ 

StepQ ; 

if (term = 1){ 

Cheer ; i := i + 1; term := 0; InitQ 

}; 

{c := 0 } [ 0 . 5 ] {c := 1}; x := x + 1 

} , 

where InitQ £ ordProg is a program that initializes a simulation of the program 
Q on input <7 q(i) (recall the bijection gQ : IN — > V from Theorem 4 ), StepQ £ 
ordProg is a program that does one single (further) step of that simulation and 
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sets term to 1 if that step has led to termination of Q, and Cheer £ ordProg is a 
program that executes 2 X many effectless steps. In the following we refer to this 
as “cheering” 2 . 

Correctness of the reduction: Intuitively, the program P starts by simulating 
Q on input 5q(0). During the simulation, it—figuratively speaking—gradually 
looses interest in further simulating Q by tossing a coin after each simulation 
step to decide whether to continue the simulation or not. If eventually P finds 
that Q has halted on input §q{ 0), it “cheers” for a number of steps exponential 
in the number of coin tosses that were made so far, namely for 2 X steps. P then 
continues with the same procedure for the next input qq( 1 ), and so on. 

The variable x keeps track of the number of loop iterations (starting from 
0), which equals the number of coin tosses. The x-th loop iteration takes place 
with probability 1 / 2 X . One loop iteration consists of a constant number of steps 
Ci in case Q did not halt on input 3q(*) in the current simulation step. Such an 
iteration therefore contributes C i/ 2 X to the expected runtime of the probabilistic 
program P. In case Q did halt, a loop iteration takes a constant number of steps 
C 2 plus 2 X additional “cheering” steps. Such an iteration therefore contributes 
c 2 + 2 x / 2 * = cif 2 x + 1 > 1 to the expected runtime. Overall, the expected runtime 
of the program P roughly resembles a geometric series with exponentially de¬ 
creasing summands. However, for each time the program Q halts on an input, a 
summand of the form C 2 / 2 X + 1 appears in this series. There are now two cases: 

(1) Q £ Ul-t, , so there exists some input p with minimal i such that gq(i) = p 
on which Q does not terminate. In that case, summands of the form ° 2 / 2 x + 1 
appear only i — 1 times in the series and therefore, the series converges—the 
expected time until termination is finite, so (P, if) £ VAST. 

(2) Q ^ UP , so Q terminates on every input. In that case, summands of 

the form c 2 / 2 X + 1 appear infinitely often in the series and therefore, the series 
diverges—the expected time until termination is infinite, so (P, if) fL VAST■ □ 

The final problem we study is universal positive almost-sure termination. In 
contrast to the non positive version, we do have a complexity leap when moving 
from non universal to universal positive almost-sure termination. We will estab¬ 
lish that UVAST is Tig-complete and thus even harder to decide than UAST■ 
For the reduction, we make use of the following Tig-complete problem: 

Theorem 9 (The Cofmiteness Problem [16]). The cofiniteness problem 

is a subset COT C ordProg, which is characterized as P £ COT iff {p | (P, if) £ 
TT} is cofinite. Let COT denote the complement of COT, i.e. COT = ordProg\ 
COT. COT is Tg -complete and COT is Tig -complete. 

Theorem 10. UVAST is Il^-complete. 

Proof. By Theorem 8, there exists a decidable relation 1Z , such that (P, p) £ 
VAST iff 3yi Vy 2 : ( 7 / 1 , y 2 , P, 77 ) £ 1Z. Therefore UVAST is definable by P G 
UVAST iff V 77 3 yi V t / 2 : (t/i, j/ 2 , P, p) £ 1Z which gives a Tig-formula. 

The program P cheers as it was able to prove the termantion of Q on input Pq(*). 


2 
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It remains to show that UVAST is Tig-hard. For that we many-one reduce 
the Tig-complete complement of the cofiniteness problem to UVAST using the 
following function /: COT < m UVAST : / takes an ordinary program Q as its 
input and computes the following probabilistic program P: 

c := 1; x := 0; term := 0; InitQ; 
while (c ^ 0){ 

StepQ; if (term = l){C7ieer; i := i + 1; term := 0; InitQ} 
x := x + 1; {c := 0} [0.5] {c := 1} 

}. 


where InitQ is a program that initializes a simulation of the program Q on input 
gQ{i) (recall the bijection qq \ IN —» V from Theorem 4 ), StepQ is a program 
that does one single (further) step of that simulation and sets term to 1 if that 
step has led to termination of Q , and Cheer is a program that executes 2 X many 
effectless steps. 

Correctness of the reduction: The only difference to the program from the 
proof of Theorem 8 is that the variable i is not initialized with 0. Thus, on input 
r] the program P also simulates Q succesively on all inputs but starting from 
input gQ(g{i )) instead of gq(0). 

The problem COT can alternatively be defined as Q £ COT iff {?? | ( Q , if) £ 
H} is infinite. There are now two cases: 

(1) Q £ COT. Thus, there are infinitely many inputs on which Q does not 
terminate. So no matter with which number g(i) £ N the variable i is initialized, 
the variable i will eventually be incremented to some value j such that Q does 
not terminate on gg(j) and therefore, in the while-loop the if -branch with 
the “cheering” steps will eventually not be executed anymore. Consequently, 
the expected time until termination of P on any input g is finite and therefore 

P £ UVAST. 

(2) Q ^ COT. Then there are only finitely many inputs on which Q does 
not terminate. Say j is minimal such that Q does not terminate on input c/q(j), 
i.e. the program Q terminates on all other inputs 3 q(T / ) with j < i'. So when 
running the program P on some input g with g(i) > j, in the while-loop the if 
branch with the “cheering” steps will be executed infinitely often. Consequently, 
the expected time until termination of P on that input g is infinite and therefore 

P <£ UVAST. □ 

7 Conclusion 

We have studied the computational complexity of solving a variety of natural 
problems which appear in the analysis of probabilistic programs: Computing 
lower bounds, upper bounds, and exact expected outcomes (CSXV, IZSXV , and 
SXV), deciding non-universal and universal almost-sure termination ( AST and 
UAST ), and deciding non-universal and universal positive almost-sure termina¬ 
tion (VAST and UVAST). Our complexity results are summarized in Figure 2. 




14 


Benjamin Lucien Kaminski, Joost-Pieter Katoen 



not 

semi—decidable; 
even with 
access to 
IA1-L —oracle 


not 

semi—decidable; 
even with 
access to 
?■£— oracle 


Fig. 2. The complexity landscape of determining expected outcomes and deciding (uni¬ 
versal) (positive) almost-sure termination. 


All examined problems are complete for their respective level of the arithmetical 
hierarchy. 

Future work consists of identifying program subclasses for which some of the 
studied problems become easier. One idea towards this would be to investigate 
the use of quantifier-elimination methods such as e.g. Skolemization. 
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